TIPR: transcription initiation pattern recognition on a genome scale

نویسندگان

  • Taj Morton
  • Weng-Keen Wong
  • Molly Megraw
چکیده

MOTIVATION The computational identification of gene transcription start sites (TSSs) can provide insights into the regulation and function of genes without performing expensive experiments, particularly in organisms with incomplete annotations. High-resolution general-purpose TSS prediction remains a challenging problem, with little recent progress on the identification and differentiation of TSSs which are arranged in different spatial patterns along the chromosome. RESULTS In this work, we present the Transcription Initiation Pattern Recognizer (TIPR), a sequence-based machine learning model that identifies TSSs with high accuracy and resolution for multiple spatial distribution patterns along the genome, including broadly distributed TSS patterns that have previously been difficult to characterize. TIPR predicts not only the locations of TSSs but also the expected spatial initiation pattern each TSS will form along the chromosome-a novel capability for TSS prediction algorithms. As spatial initiation patterns are associated with spatiotemporal expression patterns and gene function, this capability has the potential to improve gene annotations and our understanding of the regulation of transcription initiation. The high nucleotide resolution of this model locates TSSs within 10 nucleotides or less on average. AVAILABILITY AND IMPLEMENTATION Model source code is made available online at http://megraw.cgrb.oregonstate.edu/software/TIPR/. CONTACT [email protected]. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rbf Neural Network Based Human Genome Tss Identification*

Identification of functional motifs in a DNA sequence is fundamentally a statistical pattern recognition problem. This paper introduces a new algorithm for recognition of functional transcription start sites (TSSs) in human genome sequences, in which RBF neural network is adopted, and an improved heuristical method for 5-tuple feature viable construction is proposed and is implemented in two RB...

متن کامل

Histone acetylation and transcriptional regulation in the genome of Saccharomyces cerevisiae

MOTIVATION In eukaryotic genomes, histone acetylation and thereafter departure from the chromatin are essential for gene transcription initiation. Because gene transcription is tightly regulated by transcription factors, there are some speculations on the cooperation of histone acetylation and transcription factor binding. However, systematic statistical analyses of this relationship on a genom...

متن کامل

New Anti-Influenza Agents: Targeting the Virus Entry and Genome Transcription

Introduction: The emergence and spread of the pandemic H1N1 influenza virus in 2009 indicates a limitation in the strategy to control the infection, despite a long-established vaccination programme and approved antivirals. Production the proper vaccine against influenza is difficult due to the genetic recombination of virus in the event of pandemic and co-circulation of drug-resistance variants...

متن کامل

Local gradient pattern - A novel feature representation for facial expression recognition

Many researchers adopt Local Binary Pattern for pattern analysis. However, the long histogram created by Local Binary Pattern is not suitable for large-scale facial database. This paper presents a simple facial pattern descriptor for facial expression recognition. Local pattern is computed based on local gradient flow from one side to another side through the center pixel in a 3x3 pixels region...

متن کامل

Genomic Study of Replication Initiation in Human Chromosomes Reveals the Influence of Transcription Regulation and Chromatin Structure on Origin Selection

DNA replication in metazoans initiates from multiple chromosomal loci called origins. Currently, there are two methods to purify origin-centered nascent strands: lambda exonuclease digestion and anti-bromodeoxyuridine immunoprecipitation. Because both methods have unique strengths and limitations, we purified nascent strands by both methods, hybridized them independently to tiling arrays (1% ge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 31 23  شماره 

صفحات  -

تاریخ انتشار 2015